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5 SYSTEMS AND METHODS FOR CONSTRUCTING MULTI- LAYER 

TOPOLOGICAL MODELS OF COMPUTER NETWORKS 

This application claims priority from U.S. Provisional 

Application 60/202,296, entitled "Construction of a Very 
Rich, Multi- layer Topological Model of a Computer Network 

10 for Purposes of Fault Diagnosis,' 1 filed on May 5, 2000; and 
claims priority from U.S. Provisional Application 
60/202,299, entitled "A method for diagnosing faults in 
large multilayered environments guided by path and 
dependency analysis of the modeled system," filed on May 5, 

15 2000; and claims priority from U.S. Provisional Application 
60/202,298, filed on May 5, 2000, entitled "Method and 
apparatus for performing integrated computer network, 
system, and application fault management , 1 1 all of which are 
incorporated by reference in their entireties. 

20 

General Description 
The invention relates to determining topology of 
network elements in a communications network. 
The construction of computer networks' started on a large 

25 scale in the 1970* s. Computer networks link personal 

computers, workstations, servers, storage device^; printers 
and other devices. Historically, wide area computer 
networks (WANs) have enabled communications across large 
geographic areas, and local area networks (LANs) 

30 communications at individual locations. Both WANs and LANs 
have enabled sharing of network applications such as 
electronic mail, file transfer, host access and shared 
databases. Furthermore, WANs and LANs have enabled 
efficient transfer of information, and sharing of resources, 

35 which in turn increased user productivity. Clearly, 

communications networks have become vitally important for 
businesses and individuals. 

Communications networks usually transmit digital data 
in frames or packets created according to predefined 
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protocols that define their format. Data frames include 
headers (located at the beginning and containing addresses) , 
footers {located at the end of the frames) , and data fields 
that include the transmitted data bits (payload) . Data 
5 frames may have a fixed or variable length according to the 
used protocol or network type. 

A communications network transmits data from one end 
station (i.e., a computer, workstation, server etc.) to 
another using a hierarchy of protocol layers (i.e., layers 

10 that are hierarchically stacked) . In the communication 
process, each layer in the source communicates with the 
corresponding layer in the destination in accordance with a 
protocol defining the rules of communication. This is 
actually achieved by transferring information down from one 

15 layer to another across the layer stack, transmitting across 
a communication medium, and then transferring information 
back up the successive protocol layers on the other end. To 
facilitate better understanding, however, one can visualize 
a protocol layer communicating with its counterparts at the 

20 same layer level. 

The open system interconnection (OSI) model has seven 
layers that define the rules for transferring information 
between the stations. A physical layer (Layer 1) is 
responsible for the transmission of bit streams across a 

25 particular physical transmission medium. This layer 
involves a connection between two endpoints allowing 
electrical signals to be exchanged between them, 

A data link layer (Layer 2) is responsible for moving 
information across a particular link by packaging raw bits 

30 into logically structured packets or frames. Layer 2 

ensures good transmission and correct delivery by checking 
errors, re-transmitting as necessary, and attaching 
appropriate addresses to the data sent across a physical 
medium. If a destination computer does not send an 

35 acknowledgment of frame receipt. Layer 2 resends the frame. 
The contention access methods (e.g., CSMA/CD, and Token 
Passing) are regarded as Layer 2 activities. Layer 2 may 
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be further divided into two sub- layers: Logical Link Control 
(LLC) and Media Access Control (MAC) . The MAC sublayer 
defines procedures the stations must follow to share the 
link and controls access to the transmission link in an 
5 orderly manner. The MAC sublayer defines a hardware or data 
link address called a MAC address. The MAC address is 
unique for each station so that multiple stations can share 
the same medium and still uniquely identify each other. The 
LLC sublayer manages communications between devices over a 

10 single link of the communications network. 

A network layer (Layer 3) is set up to route data from 
one network user to another. Layer 3 is responsible for 
establishing, maintaining, and terminating the network 
connection between two users and for transferring data along 

15 that connection. Layer 3 addresses, messages, and 

determines the route along the network from the source to 
the destination computer. Layer 3 manages traffic, such as 
switching, routing, and controlling the congestion of data 
transmissions . 

20 A transport layer (Layer 4) is responsible for 

providing data transfer between two users at an agreed level 
of quality. When a connection is established, this layer is 
responsible for selecting a particular quality of service 
(QoS) , for monitoring transmissions to ensure the selected 

25 QoS, and for notifying the users if the QoS deteriorates. 
Layer 4 also provides for error recognition and recovery, 
repackaging of long messages into smaller frames of 
information, and acknowledgments of receipt. 

A session layer (Layer 5) focuses on providing services 

30 used to organize communication and synchronize the dialog 
that takes place between users and to manage the data 
exchange. The primary concern of Layer 5 is controlling 
when users can send and receive concurrently or alternately. 
A presentation layer (Layer 6) is responsible for the 

35 presentation of information in a way that is meaningful to 
network users. This may include character code 
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transmission, data conversion, or data compression and 
expansion. 

Layer 6 translates data from both Layer 5 and from 
Layer 7 into an intermediate format and provides data 
5 encryption and compression services. Layer 7 is an 
application layer that provides means for application 
processes to access the system interconnection facilities in 
order to exchange information. This includes services used 
to establish and terminate the connections between users and 

10 to monitor and manage the systems being interconnected, as 
well as the various resources they employ. 

As data is passed down through the layers, each layer 
may or may not add protocol information to the data, for 
example, by encapsulating frames with a header or removing 

15 the header, depending on the direction in the protocol 
stack. The individual protocols define the format of the 
headers . 

MAC address includes a source address and a destination 
address, which have a predefined relationship to a network 

20 station. Higher network layers provide a network address 
that has a logical relationship established by a network 
administrator according to a predetermined network 
addressing arrangement. The assigned network address conveys 
information that can be used by a router when routing frames 

25 through the internetwork. If the network address is 

hierarchical, a router may use a portion of the address to 
route the packet to a higher- level partition or domain in 
the internetwork. Some protocols are hierarchical others 
are not so hierarchical routing may or may not be available. 

30 The global network may be subdivided into IP networks, 

which in turn may be subdivided into subnets. An IP address 
includes a network number (assigned by I ANA) , a subnet 
number (assigned by a network administrator) , and a host 
that identifies an end station. The host number may be 

35 assigned by a network administrator, or may be assigned 

dynamically. This is a form of hierarchical addressing that 
is used by IP routing algorithms to perform hierarchical or 
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prefix routing operations. Routing algorithms maintain 
information of all higher- level routing environments in 
routing tables for domains by recording their shortest 
unique address prefixes. 
5 A station may support more than one network layer 

protocol. Such station has multiple network addresses and 
multiple protocol stacks that present the same MAC address 
on a port for the different protocols. Thus, a multi- 
protocol stack station connected to both an IP and an IPX 
10 network includes an IP network address and an IPX network 
address. 

A communications network may include a number of 
network entities (or nodes) , a number of interconnecting 
links and communication devices. A network node is, for 

15 example, a personal computer, a network printer, file server 
or the like. An interconnecting link is, for example, an 
Ethernet, Token-Ring or other type network link. 
Communication devices include routers, switches, bridges or 
their equivalents. As computer networks have grown in size, 

20 network management systems that facilitate the management of 
network entities, communication links and communication 
devices have become necessary tools for a network 
administrator . 

A bridge or a switch is a Layer 2 entity that is 

25 typically a computer with a plurality of ports for 

establishing connections to other entities. The bridging 
function includes receiving data from a port and 
transferring that data to other ports for receipt by other 
entities. A bridge moves data frames from one port to 

30 another using the end-station MAC address information 

contained in the switched frames. Switches interconnect the 
communication media to form small domains of stations, such 
as a subnetwork. Subnetworks or subnets provide an 
organizational overlay to an internetwork that facilitates 

35 transmission of data between the end stations, particularly 
for broadcast transmissions. The subnet functions to limit 
the proliferation of broadcast frames to stations within a 
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broadcast domain, 

A router is an intermediate station that interconnects 
domains or subnets by providing path from a node on a first 
network to a node on a second network. There are single 
5 protocol or mult i -protocol routers, central or peripheral 
routers, and LAN or WAN routers. A peripheral router 
connects a network to a larger internetwork, and thus may be 
limited to a single protocol. A central router may be 
connected to a different board in a server or a hub and thus 

10 usually has a multi-protocol capability. 

A router provides the path by first determining a route 
and then providing an initial connection for the path. A 
router executes network routing software that depends on the 
used protocol. A router can work with different data-link 

15 layer protocols and thus can connect networks using 

different architectures, for example, Ethernet to Token Ring 
to FDDI . Furthermore, there are routers of several levels, 
wherein, for example, a subnetwork router can communicate 
with a network router. Organizing a communications network 

20 into levels simplifies the routing tasks since a router 

needs to find only the level it must deal with. The use of 
different network levels is shown in Fig. 1. 

In general, a global communications network connects 
devices separated by hundreds of kilometers. A LAN covers a 

25 limited area of maximum several kilometers in radius 

connecting devices in the same building or in a group of 
buildings. LANs usually include bridges or switches 
connecting several end-stations and a server. In a LAN, a 
bridge or a switch broadcasts traffic to all stations. 

3 0 Until a few years ago, a LAN was user-owned (did not run 

over leased lines) with gateways to public or other private 
networks. When a user moved or changed to an end-station at 
another location on the network, a network administrator had 
to rewire and reconfigure the user's station. This has 

35 changed with the introduction of virtual LANs. 

A virtual LAN (VLAN) is a logical Layer 2 broadcast 
domain, which enables a logical segmentation of the network 



WO 01/86844 



PCT/US01/14767 



7 

without changing the physical connections. A VLAN enabled 
switch segments the connected stations into logically 
defined groups. Broadcast traffic from a server or an end- 
stations in a particular VLAN is replicated only on those 
5 ports connected to end- stations belonging to that VLAN. The 
broadcast traffic is blocked from ports with no end-points 
belonging to that VLAN, creating a similar type of broadcast 
containment that routers provide. VLANs may also be defined 
between different domains connected by a router. In this 

10 case, the router passes network traffic from one domain to 
the other (as done without defining a VLAN) , and passes 
network traffic from one VLAN to the other. The router also 
passes network traffic between VLANs that are in the same 
domain because VLANs do not normally share user information. 

15 The router is configured as a member of all VLANs . 

There are several VLAN implementations and several 
methods used to determine VLAN membership. These methods 
include defining the membership by port, by MAC address, by 
protocol, by IP multicast address, by policy, or a 

20 combination of these methods. For port-based VLANs, IEEE 

802. 1Q defines an industry standard implementation. A large 
number of physical ports of a switch may be associated with 
a number of groups within the switch using a VLAN 
arrangement. This arrangement associates a port with a 

25 particular VLAN designation. 

The VLAN designation is assigned (e.g., programmed) to 
each internal port and is stored in a memory portion of the 
switch. Every time a message is received by a switch on an 
internal port, the VLAN designation of that port is 

30 associated with the message. 

Several switches can be interconnected to extend the 
VLAN port associations in the internetwork. This VLAN 
arrangement allows exchange of messages between all ports 
and entities of the network having the same VLAN designation 

35 by associating a VLAN designation with each message. This 
way the entities with the same VLAN designation operate as 
if they are all part of the same LAN. Each VLAN may be 
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further associated with a subnet or broadcast domain so as 
to constrain broadcasting of frame messages to a defined 
subset of stations* 

Virtual Private Networks (VPNs) have been designed to 
5 interconnect end- stations that are geographically dispersed. 
For example, owners of large communications networks can 
provide centralized management services to small and medium 
sized businesses. The provider can configure VPNs that 
interconnect various customer sites in geographically 
10 separate locations. These VPNs offer privacy and cost 
efficiency through sharing of network infrastructure. 
Various VPNs have been proposed with various degrees of 
security, privacy, scalability, ease of deployment and 
manageabi 1 i ty . 

15 A global communications network may use a different 

levels different routing and connection management protocols 
such as International Standards Organization (ISO) Open 
Systems Interface (OSI) Intermediate Systems to Intermediate 
Systems (IS- IS) , and Internet Open Shortest Path First 

20 (OSPF) protocols are used for connectionless routing of data 
frames. Asynchronous Transfer Mode (ATM) Forum Private 
Network-Network- Interface (PNNI) protocol is used for 
connection oriented multi-media services. The routing 
protocols identify a network node using a global address of 

25 a Route Server Element (RSE) . The RSEs generate routing 
that identify optimal routes for communication throughout 
the network. The RSE is responsible for administration of 
the algorithms that enable a node to keep its view of the 
network topology and performance metric current, referred to 

30 as Routing Information Exchange (RIE) . Thus an RSE usually 
acts as a central element for the routing of traffic through 
the node. 

The Internet Protocol (IP) divides networks into 
autonomous systems (AS) with routing areas and area sub- 
35 nets. The area sub-nets form adjacencies and flooding of 
link state information. The OSPF link state protocol 
provides for dynamically auto -configurating the AS topology 



WO 01/86844 



PCT/US01/14767 



and for forwarding data frames received from adjacent nodes. 

The link states are propagated depending on the location 
and designation of the routers which maintain routing tables 
and forward data frames. The protocol defines four types of 
5 routers, that is, an internal, border, backbone and boundary 
router. The border, backbone and boundary routers advertise 
summary route information and internal routers advertise 
complete link state information within an area. An optimal 
routing policy such as SPF can be used for routing within an 

10 AS. Each router node therefore maintains a topology 

database, a routing table and a forwarding capability. A 
Type of Service (TOS) field in the IP packets may be used to 
create routes based on QOS requirements such as delay, 
throughput and reliability. All OSPF protocol exchanges are 

15 authenticated. 

The OSI Intra-Domain Routing Information Exchange 
protocol provides for routing domains composed of areas 
within which detailed routing information is maintained 
within individual network elements called Intermediate 

20 Systems (IS) . The IS can dynamically auto- configure the 

area topology and forward protocol data units (PDU) received 
from adjacent nodes (i.e., neighbors). Each node develops a 
routing table, which allows PDUs to be forwarded based on 
some optimal policy (e.g., the SPF algorithm) . An IS uses a 

25 "flooding" scheme to exchange authenticated link states 
throughout an area. In a flooding scheme, when an IS 
receives a PDU it automatically copies the PDU to all 
adjacent nodes. This way the PDU propagates throughout the 
network. Link states carry addressing and quality of 

30 service (QOS) information. Routing between areas is 

accomplished using Level 2 intermediate system. Level 2 
intermediate systems use area addresses to route a data 
frame according to SPF (or another policy) to the nearest 
Level 2 intermediate system, which in turn forwards the 

35 frame within the destination routing area. 

ATM Forum PNNI protocols describe both routing and 
signaling in an integral scheme carrying of both connection- 
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oriented and connectionless traffic. PNNI accommodates 
large networks by forming multiple levels of hierarchy. At 
the lowest level, PNNI uses peer groups that are a 
collection of logical nodes that exchange information with 
5 other members of the group. Peer groups communicate by the 
use of Hello-protocols and flooding of link states. Logical 
peer groups are then formed to communicate between peer 
groups. This organization is accomplished by selecting peer 
group and logical peer group leaders which form overlay 

10 networks for the distribution of summary routing information 
at different levels . Members of the same peer group maintain 
an identical view of the PG, and nodes outside of the PG 
• exchange messages with the PG as a whole. PNNI also 
provides for signaling to establish routes between a source 

15 and a destination. The created routes may be based on QoS 
criteria as defined by User Network Interface (UNI) traffic 
management specifications. The QoS criteria include 
parameters such as available bit rate (ABR) , maximum 
supported bit rate (MBR) , likely variability in end-to-end 

20 delay for ATM cells, and cell loss ratio with cell loss 

priority. The QoS routing is required for applications with 
real-time requirements and scalability to large global 
networks. PNNI makes no assumption about an optimal routing 
algorithm, such as SPF. Optimal routing is left to 

25 Implementation. The protocol is capable of responding to 
changes in traffic patterns in the form of dynamic routing 
of traffic. These features and the reachability of 
information, also inherent in the routing database, enables 
efficient routing decisions for carrying connection-oriented 

30 and connectionless traffic. PNNI provides large-scale 

networks with a single routing protocol, unlike the Internet 
strategy of using a number of protocols at various levels. 

In general, the use of WANs, LANs, VPNs, and VLANs has 
increased the number and complexity of communications 

35 networks. These networks continuously evolve and change due 
to growth and introduction of new interconnections, 
topologies, protocols, or applications. Furthermore, most 
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networks have redundant communication paths to prevent 
portions of the network from being isolated due to link 
failures. Also, multiple paths can be used simultaneously to 
load-balance data between the paths. However, redundant 
5 paths can also introduce problems such as formation of 

loops. Furthermore, network performance can degrade due to 
improper network configurations, inefficient or incorrect 
routing, redundant network traffic or other problems. 
Network hardware and software systems may also contain 

10 design flaws that affect network performance or limit access 
by users to certain of the resources on the network. These 
factors make network management complex and difficult. 

A network management process controls and optimizes the 
efficiency and productivity of a communications network. A 

15 network management station manages the network entities 

(e.g., routers bridges switches, servers, storage devices, 
computers, printers) using a network management protocol 
such as a Simple Network Management Protocol (SNMP) , 
Internet Control Message Protocol (ICMP) , or another network 

20 management protocol known in the art. Using a network 
management protocol, the network management station can 
deliver information or receive information by actively 
polling the network entities or by receiving unsolicited 
information from the network entities. Using SNMP, a 

25 network management station can executes a set, get, or get- 
next functions to sett and retrieve information from a 
network entity. This information may be stored within the 
polled network entity as Management Information Base (MIB) . 
The network management station can receive unsolicited 

30 information from a network entity in the form of an SNMP 
trap. Network entities may send SNMP traps to the network 
management station when a problem in the network or network 
entity occurs. 

A network management station may be implemented using 

35 any general purpose computer system, which is programmable 
using a high-level computer programming language or using 
specially programmed, special purpose hardware. The hardware 
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includes a processor executing an operating system providing 
a platform for computer programs that run scheduling, 
debugging, input-output control, accounting compilation, 
storage assignment, data management, memory management, and 
5 communication control and other services. The application 
programs are written in high level programming languages . 

A network management station can include a network 
manager unit, a network communication interface, a data 
acquisition unit, a data correlation unit, and a graphical 

10 user interface. The data correlation unit interprets data 
received through the data acquisition unit and presents the 
interpreted data to a user on the graphical user interface. 

The network communication interface may include 
transport protocols and LAN drivers used to communicate 

15 information to the communications network. The transport 
protocols may be IPX, TCP/IP or other well-known transport 
protocols. The LAN drivers may include software required to 
transmit data on a communications network through the 
network interface. The LAN drivers are generally provided 

20 by the manufacturer of the network interface for a general 
purpose computer for the purpose of communicating through 
the network interface. The network manager unit may be an 
SNMP network manager/agent implementing SNMP functions, or 
another type of network manager unit performing associated 

25 management functions. The network manager unit utilizes the 
network communication interface to transfer requests to 
network entities over a communications network. 

A network management station may use a network 
management agent residing on a network entity. The network 

30 management agent may be a software process running on a 

processor or may be special purpose hardware . The network 
management agent may be an SNMP agent (or I CMP agent?), 
which may include a data collection unit, a network manager 
unit, and a network communication interface for 

35 communication as described above. For example, this 

communication may use network management functions such as 
SNMP functions. Alternatively, a network management agent, 
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residing on a network entity, may include a data correlation 
unit, a data collection unit, a network manager unit and a 
network communication interface for communication, 

5 Summary of the Invention 

The present invention is a system, a method and a 
product (that can be stored in a computer- readable storage 
medium) for determining topology of a communications network 
or determining topology of a part of the network. The 
10 system and method provide a modular framework that can be 
integrated with a fault diagnoser, an impact analyzer, a 
help desk system or any other module of a network management 
system . 

According to one aspect, a system and method for 

15 determining topology of a part of a communications network 
include a topology mapper. The topology mapper includes 
discovery module, a memory, and a presentation module. The 
discovery module is constructed and arranged to discover 
network elements in a communications network. The memory 

20 constructed and arranged to store topology data received 
from the discovery module. The presentation module 
constructed and arranged to present data related to topology 
of the communication network. 

According to another aspect, network mapping means 

25 includes means for discovering network elements in a 

communications network, means for storing topology data 
received from the discovery module, and means for presenting 
data related to topology of the communication network. 

Preferred embodiments of these aspect may include one 

30 or more of the following: 

The discovery module is further constructed to 
determine logical dependencies between network layers of the 
communications network. The discovery module is further 
constructed and arranged to determine dependencies between 

35 an application and a system in the communication network. 
The discovery module is further constructed to determine 
topological change in the network element. 
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The topology mapper may include a data log constructed 
and arranged to store information related over time to the 
function of the discovery module. The presentation module 
is constructed and arranged to display historical topology 
5 data and current topology data. The presentation module is 
constructed and arranged to display current topology data 
and intended topology data. The presentation module is 
constructed and arranged to display the topology data in a 
mult i- layer format. 

10 The topology mapper is constructed and arranged to 

display interdependencies among elements of two of the 
network layers. The topology mapper is constructed and 
arranged to display topological change for the purposes of 
historical reconstruction . 

15 According to yet another aspect, a network management 

system including a topology mapper. The topology mapper 
includes discovery module, a memory, and a presentation 
module. The discovery module is constructed and arranged to 
discover network elements in a communications network. The 

20 memory constructed and arranged to store topology data 

received from the discovery module. The presentation module 
constructed and arranged to present data related to topology 
of the communication network. 

Advantageously, one or several embodiments of the 

25 present system and method reduce impact on the network 
during periods of topology determination and maintenance. 
They provide for logging of elements of topological change 
for purposes of "scenes 1 recreation during phases of 
historical and intermittent fault diagnosis. These 

30 embodiments enable robust handling of unmodeled elements of 
a topology and correct association within the topological 
landscape such that a topologically centric fault diagnosis 
system can accurately and effectively perform a root cause 
analysis . 

35 Furthermore, some embodiments of the present system and 

method enable the use of other forms of empirical and 
inferred network data results in a complete and accurate 
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representation of the topology. They capture logical 
dependencies between network layers and those between 
application and systems with the network- This allows a 
topologically centric fault diagnosis system to accurately 
5 and effectively perform its designed task. The modeling of 
intent gives the user of a topologically centric fault 
diagnosis system a process for dealing with network 
redundancy and topological change. 

Preferably, some embodiments of the present system and 
10 method enable maintenance of a topology that is historical 
in nature, maintenance of a topology that captures design 
and management intentions, and maintenance of three separate 
topologies, i.e., a current, historical and design intent 
topology. 

15 Preferably, some embodiments of the present system and 

method also enable maintenance of a rich multi- layer 
topology that incorporates multiple network layers within a 
single topological model; and representation of 
interdependencies amongst elements of different network 

20 layers. Some embodiments of the present system and method 
also enable presentation of the concept of separation of 

i 

application, system and network "space 1 ; mapping of 
interdependencies between applications, systems and network 
entities. These embodiments use the concept of handling 

25 topology with compartmentalization. 

Preferably, some embodiments of the present system and 
method also enable reduction of the network traffic impact 
through utilization of tactical monitoring for topology 
change; enable data gathering by multiple means, 

30 specifically by knowledge of known support by topological 
entity in question; enable both logging topological changes 
for the purpose of historical reconstruction and logging 
type of topological change with specialized records; and 
enable finding proper location within the modeled topology 

35 of a previously unmodeled, but addressable entity. 

Brief Description of the Drawings 
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Fig. 1 shows diagrammatically several network 
management modules connectable to a communications network. 

Pigs. 2 and 2A show diagrammatically hierarchical 
architecture of the communications network of Fig. 1. 
5 Figs. 3 and 3A show a high level flow diagram of a 

topology mapping process. 

Figs. 4, 4A r and 4B are flow diagrams that 
illustrate an algorithm for modeling a border gateway 
protocol (BGP-4) network. 
10 Figs. 5 and 5A are flow diagrams that illustrate an 

algorithm for finding Routing Information Protocol (RIP) 
network. 

Figs. 6 and 6A are block diagrams of a routed 
network configuration of an OSPF network. 
15 Figs. 6B through 6K are flow diagrams that 

illustrate an algorithm for finding an OSPF network such as 
the network shown in Fig. 6. 

Fig. 7 illustrates diagrammatically a semantic 
Layer-2 topology model . 
20 Figs. 8 and 8A show diagrammatically a rich multi- 

layer topological model of a portion of the communications 
network of Fig. 1. 

Fig. 8B shows diagrammatically an intended and 
^historical relationship between three modeled elements. 
25 Figs. 9, 9A, and 9B are flow diagrams that 

illustrate an algorithm for finding a change in network 
topology, for example, after displaying the network in a 
model shown in Figs. 8, 8A # and 8B. 

Fig. 9C is a flow diagram that illustrates an 
30 algorithm for validating network topology. 

Fig. 10A and 10B are flow diagrams that illustrate 
an algorithm for finding a path to or from an unmodeled 
network entity in a modeled network region. 

Fig. 11 is a semantic data model used when constructing 
35 the rich multi-layer topological model of Figs. 8 and 8A. 



Description of Preferred Embodiments 
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Fig. 1 shows diagrammatically a network management 
system 10 including a fault diagnosis engine 12, a topology 
mapper 14, an impact analyzer 16 and a help desk system 18. 
The network management system communicates with a 
5 communications network 20 (or application service) . The 
network includes a set of interconnected network elements 
such as routers, bridges, switches, and repeaters. These 
network elements provide transportation of data between end 
stations. Furthermore, there are computers known as servers 

10 that provide services such as e-mail, accounting software, 
sales tools, etc. Typically, data is transmitted 
electronically or optically, and network elements can 
forward data in packets, frames or cells to the intended 
destination. Servers include network adapters and/or 

15 software that interpret the electronic or optical data 

packet into the data elements and pass these elements to the 
appropriate application being hosted. 

The network management system 10 includes a 
commercially available processor (for example, Pentium 

20 microprocessor manufactured by Intel Corporation) executing 
an operating system providing an operating environment for a 
network management program. The processor and the operating 
system provide a computer platform for which application 
programs are written in higher level programming languages . 

25 The computer (or application host) interfaces with 

permanent data storage, such as a magnetic or optical disk 
drive, a disk array, non-volatile RAM disk, or a storage 
area network, which maintain data files such as user 
configurations and policies. In general, the network 

30 management program may be configured as a generic software 
application residing in any commercially available computing 
platform. 

Preferably, fault diagnosis engine 12, topology mapper 
14, and help desk system 18 are software applications 
35 written iri Java and running on any computer with a Java 
Runtime Environment (JRE) . For example, a Dell laptop 
computer with an Intel Pentium processor running the Windows 
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2000 operating system, or a Sun Ultra 60 computer running 
Solaris v. 2.7. Alternatively, fault diagnosis engine 12, 
topology mapper 14, and help desk system 18 are developed in 
any object oriented or structured programming language, and 
5 compiled for execution on any one of many computer 
platforms, or could be implemented on a neural network 
computing device. 

The computer has a network adaptor that provides 
communication (preferably, but not necessarily, IP) to the 

10 users on the network. The fault diagnosis engine 

application may share a host with help desk system, and/or 
the topology mapper, or each can run on a separate host, in 
which case they communicate using a network adaptor. 
Topology mapper 14 determines the network topology and 

15 creates a model . The permanent data storage holds data 
files that describe the current network topology, and 
configuration files that control the performance of topology 
mapper 14. A user is an end station, interfaced to access 
the network or services, used by a person who is using the 

20 network, or is using services provided by the network. 

The topology mapper 14 services topology requests 
from a fault diagnosis engine 12. In order to determine the 
root cause of reported communication faults, the fault 
diagnosis engine needs certain topology information, either 

25 current or at some point in history. For example, the fault 
diagnosis engine may request the Layer- 3 data path from one 
network element to another, as the network existed at a 
certain time yesterday afternoon. Additionally, the fault 
diagnosis engine may request information about the intent of 

30 the topology. By maintaining a versioned set of connection 
models that represents network changes and network intention 
(Fig. 8B) , the invention can provide this needed 
information. 

Figs. 2 and 2A show diagrammatically a hierarchical 
35 architecture of the communications network shown in Fig.l. 
The individual layers are physical and logical abstractions. 
A typical global network 20 (the Internet) is a collection 
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of network devices collaborating together to transmit data. 

The network devices are logically organized and grouped to 
allow standard network data forwarding protocols to be used 
to determine data paths through the network. Additionally, 
5 certain devices are deemed "border 11 devices since they 
transmit data from one logical grouping to another. 
Referring to global network 20 in Fig. 2, a person sitting at 
a PC uses a web browser application 24 within an autonomous 
system (AS) 22. Application 24 requests a web page from a 

10 Yahoo I web server 39 located in an AS 38. Modeling of the 
actual network provides a very rich multi-layer, accurate 
representation of the network, which a fault diagnosis 
process can use to diagnose faults. 

Internet service providers and telecommunications 

15 companies like AT&T, MCI, and Sprint provide and maintain 
autonomous systems 22,30,34,38, communicating by network 
routers 28,32 and 36. Routers 28,32 and 36 are organized 
into logical administrative groupings . The most common 
protocol used at this level is the BGP (Border Gateway 

20 Protocol) . Routers 28,32 and 36 use an External BGP protocol 
to organize router inf ormation, and are located at borders 
of the Autonomous Systems. 

At the AS level 30, network routers 44 and 47 are again 
administratively organized into logical groupings called 

25 confederations. Routers 44 and 47 use an Internal BGP 

protocol to organize route information and form the borders 
of the confederations 40,42, and 48. If all the routers are 
using the same interior gateway protocol and are centrally 
located and administered, the AS 30 may not have distinct 

30 confederations 40,42 and 48, but may function as both a 
confederation and an AS. 

At the confederation level 40, one of many specific 
interior gateway routing protocols is used to determine 
routing information for data packets. The use of a given 

35 interior gateway routing protocol is confined to a single 
routing domain 50. The routing protocols include, but are 
not limited to, OSPP, RIP, and IS- IS. At routing domain 
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level 50, border routers 56 and 58 are called Inter-Domain 
Routers since they have the ability to transmit data between 
distinct routing domains even if the domains use different 
interior gateway routing protocols. Within a particular 
5 interior gateway protocol, border routers may be treated as 
if they were AS border routers. 

Within routing domain 50, a single interior gateway 
routing protocol is used. If the domain is large enough and 
the selected protocol supports sub-domains, routing domain 

10 50 may be further broken up into sub-domains 60,62 and 64. 
OSPP may be used as the interior gateway protocol inside 
routing domain 50. OSPF allows network administrators to 
organize a network into yet-smaller administrative 
■groupings, i.e., sub-domains 60, 62 and 64, called Areas. 

15 Border routers 63 and 65 at this level are called Area 

Border Routers, and use the OSPF protocol to determine how 
to transmit data between OSPF Areas 60, 62 and 64. 

Inside an OSPF Area, interior routers use the OSPF 
protocol to determine the data paths. This is the final 

20 logical grouping, which uses routers as its data transport 
mechanism. IP-based subnets 70,72 and 74 are typically 
defined as the sub-regions inside an OSPF Area. IP subnets 
70,72 and 74 define a common IP addressing space for devices 
inside the subnet. 

25 At the subnet level 70, data is transmitted from router 

to router via Layer-2 bridges and switches. Typically, the 
subnet topology is highly redundant, with numerous physical 
connections between the bridges and switches. Layer-2 
protocol (usually Spanning Tree) is used to determine the 

3 0 single path data will travel between bridges. Only one path 
will exist between any two devices inside a Subnet in which 
Spanning Tree is used. 

Topology mapper 14 runs topology mapper algorithms 
for discovering and modeling of a single aspect (or set of 

35 layers) of the generic model. As the characteristics or 
capabilities of each network device become known to mapper 
14, an appropriate algorithm is executed. A BGP algorithm 
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- described in connection with Fig. 4, attempts to discover 
and model the network at global network level 20, autonomous 
system level 30, and confederation levels 40. OSPF and RIP 
algorithms, described in connection with Figs. 5 and 6, 
5 discover and model the network at the Routing Domain, and 
Sub Domain levels. 

Figs. 3 and 3A show a high level flow diagram of a 
topology mapping process. A process 100 is initiated when a 
user provides or modifies a set of IP address ranges and/or 

10 a set of "seed 11 router IP addresses. A ""seed" router is 
merely a router that is used to initiate discovery. Each IP 
address in the ranges is sent SNMP commands requesting 
sysOID and sysDescr objects in the MIB-II System Group 
(described in technical standard RFC1213) . In step 102, an 

15 entity is "nominated 1 1 for placement within topology by a 
""ping sweep 1 ' of an IP address range, or by discovering 
peers and connections of a (previously nominated) adjacent 
entity. Step 102 identifies entity type by reading sysOID 
and sysDesor object via SNMP protocol. Step 104 creates an 

20 object of proper type to represent the entity. In steps 110 
to 114, the type of model created depends on the sysOID and 
sysDescr. In this way, a specific model type is created for 
a Cisco 7500 router, or a Nortel 5500 switch, for example. 

If an entity does not have sufficient MIB support in 

25 steps 140 through 150, a telnet script is executed to mine 
the necessary information from the entity. Alternatively, a 
combination of SNMP gets and telnet scripts can be used to 
accomplish this task. It is certainly possible to use CMIP 
or HTTP or other protocols to mine the information. 

30 In response to the ""discover network protocols 1 1 

message (step 108) , the entity model will return a set of 
protocol models (step 115) . Each protocol is sent the 
""initiate discovery 1 1 message (step 112). Each protocol 
type responds uniquely to the ""initiate discovery 1 ' 

35 message. This message actually initiates various protocol 
discovery algorithms (stepll6) , and those algorithms will 
nominate other entities for discovery. That is, if a BGP4 
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protocol model is sent the ""initiate discovery 11 message/ 
the BGP4 modeling algorithm is initiated. Any BGP4 peers of 
this entity that are not already modeled will be nominated 
for discovery. 

5 Fig. 3A illustrates network protocol discovery 120. 

The object-oriented methodology allows objects of different 
types to respond to messages differently which is known as 
polymorphism. The algorithm executes steps 122 through 136 
if the entity type has MIBs that provide needed information. 

10 For each supported network protocol, in step 126, the 

algorithm looks up the OID that defines management for this 
protocol. The algorithm creates a protocol model (step 132) 
and associates it with entity model via the HasCapability 
relation (step 134) . Referring to steps 140 through 150, a 

15 scripting language called ""Expect 11 is used to create 

scripts to automate telnet sessions with network elements 
and parse out text elements. The interesting information 
parsed is stored in a simple file (either ASCII or XML) to 
create a protocol model (step 148) . 

20 Figs. 4, 4A and 4B are flow diagrams that illustrate 

an algorithm for modeling a BGP-4 network as part of a rich, 
topology model. This algorithm can provide topology and 
data-path information to fault diagnosis engine 12 (Fig. 1) . 
The algorithm is initiated when a router with BGP peers is 

25 nominated for discovery. The fact that the router has BGP 
peers is determined either by reading a standard (technical 
standard RFC1657) , proprietary MIB, or by parsing an 
automated telnet session with the router. A BGP router 
maintains a connection with each peer (as defined by 

30 technical standard RFC1771) . A connection with a peer in a 
different AS is known as an external link, or EBGP link 
(Fig. 2) . EBGP links are significant because they indicate 
which regions (i.e., autonomous systems 40,42 and 48) are 
adjacent to the region being examined, and which entities in 

35 this region are region border entities. 

This algorithm can also map a graph of clusters 
interconnected by route reflectors (RR) and other non-client 
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routers if any such cluster exists in AS 22, 30,34 or 38. 
The AS may be a sub-AS within BGP confederation 40, 42 or 
48. If multiple AS's are returned, the topology represents a 
BGP confederation. Otherwise, the topology is a single 
5 independent AS. The algorithm uses two defined sets for AS 
as follows: Set I contains a list of AS f s we are about to 
examine; and Set II contains a list of AS's we have visited. 

Furthermore, the algorithm defines Set III to contain the 
list of EBGP sessions as the result of the algorithm. For 

10 each AS, the algorithm uses three sets as follows: Set A. 
contains a list of nodes we need to examine; Set B contains 
a list of nodes we have visited; and Set C contains a list 
of EBGP links we have found in the AS. The algorithm also 
produces a list of clusters found in the topology. 

15 The algorithm provides Set III with a list of EBGP 

links in the topology; Set B of each BGPASManager with a 
list of nodes in the AS; and Set C of each BGPASManager with 
a list of BGP border nodes in the AS. 

Referring to Fig. 4, flow diagram 140 is executed by a 

20 BGP Domain Manager, which initializes the set of AS f s to 
examine. The algorithm also initiates the BGP AS Manager 
with an AS and a BGP node (step 164) . Based on the results 
of the AS Manager, the AS is qualified as either part of a 
confederation (step 178) or an independent AS (stepl76) . 

25 Also, as side effect of calling the AS Manager, in steps 166 
through 170, the set of AS's to examine (set I) is updated 
based on external links discovered. This process will 
continue examining AS 1 s until the set of AS's yet to examine 
is empty. 

30 Referring to Figs. 4A and 4B, a BGPAS manager process 

180 runs on a BGP node in a particular AS. The list of 
peers is retrieved from the device (typically via SNMP) . 
The external links are added to Set C. Adjacent AS's are 
added to Set I (step 222) . The internal peers are added to 

35 the set of nodes to examine (set A) . If the node has as 

least three internal peers (step 190) , and each peer has at 
least three peers, the algorithm identifies the node as part 
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of a cluster (step 196) . 

In step 182 for a BGP node, the algorithm adds the node 
to Set A and finds all BGP peers of the node (step 184) . If 
there is only one IBGP peer and cluster already exists for 
5 the peer, the algorithm adds the node to the cluster list 
(step 194) . If there are only two IBGP peers and each of the 
peers has at least 3 IBGP connections and cluster already 
exists for the peer(s), the algorithm adds node to the 
cluster list. Otherwise, it creates a cluster with the 

10 peer's RID(s) (step 196) . 

In step 200, if not present in Set C, the algorithm 
adds the EBGP peers to Set C. Step 222 adds the external 
peer's AS to Set 1 if the AS is not in Set II. Step 224 
moves the node to Set B. In step 225, for each of its IBGP 

15 peers, if it is not in Set B, the algorithm adds it to the 
end of list A. If list A is empty, the algorithm moves the 
AS from Set I to Set II (step 230) then the algorithm copies 
each BGP pair in Set C to Set III if it is not in Set III 
(step 232) . 

20 Figs. 5 and 5A are flow diagrams that illustrate an 

algorithm for finding a RIP network. This algorithm is 
initiated when a router running RIP is nominated for 
discovery. The result of running the algorithm is that the 
connectivity between all RIP nodes in the region are 

25 discovered. In flow chart 240, the algorithm creates " V RIP 
Peer 11 connections that relate the appropriate interface 
models of the RIP peer models. Another way of representing 
the results would be maintaining a list of neighbors and 
interfaces for each RIP node. 

30 Referring to Fig. 5, the algorithm maintains a list 

of candidates for RIP modeling. Initially, this list has a 
single entry - that is the first router found with RIP peers 
(step 242) . Each candidate in the list is processed to 
determine its peers. Peer determination is done by simply 

35 reading an SNMP table (step 254); generally the SNMP table 
defined by technical standard RFC1724 . For each peer, the 
interface that routes to that peer is determined in steps 
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256 through 270. The interface of that peer that routes to 
this node is determined in a similar way. A logical 
connection, specifically, a ""RIP Peer 11 is created, 
associating the two interface models (steps 264 and 266) . 
5 The peer is then added to the list of candidates for 

discovery, if it hasn't been processed already. Once the 
list of candidates is empty, the modeling is done. 

Steps 281 through 292 are used for finding the 
interface that is routing to a peer. This is done by 
10 figuring out which interface is routing to the peer by 

^indexing into 11 the route table, (either the ipCidrTable 
or ipRouteTable) , wherein steps 281-292 are done on the 
ipCidrTable . 

Alternatively, to find the interface that routes to 
15 RIP peers, it is also possible to read the ipAddrTable (as 
defined by technical standard RFC1213) . This table contains 
the ipAddresses and network masks for all the local subnets, 
and a reference to which interface is attached to each 
subnet. Because RIP is a broadcast -based protocol all 
20 peers will be attached to local subnets. The algorithm of 
steps 281-292, uses ipCidrTable which is more flexible in 
that it can find the interface that routes to subnets that 
are not adjacent to the node being examined. This process, 
however, could take as many as 32 reads to complete for each 
25 peer. Using the ipAddrTable algorithm has an advantage in 
that, practically, routers generally have less than 32 IP 
addresses, and the entire ipAddrTable could be read once, 
and cached for future uses (or finding other peer 
interfaces) . 

30 Figs. 6 and 6A are block diagrams of a routed network 

configuration of an OSPF network. Fig. 6 illustrates AS 1 
with a backbone area 300 (Area 0.0.0.0) and routing Areas 
0.0.0.1; 0.0.0.2; 0.0.0.3; and 0.0.0.5. Backbone area 300 
(Area 0.0.0.0) includes an external border router 302 

35 (running EGBP) and Area Border Routers (ABR) 304, 306 and 
308. Furthermore, Fig. 6 illustrates routers within each 
sub-domain 310, 320, 330, and 340, such as routers 316, 314, 
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& 348 in area 0.0.0.1. 

Fig. 6A illustrates another AS 2 connected by EGBP 
router 352 to EGBP router 302 (Fig. 6) . To find the above- 
mentioned routers, the system executes an GSPF topology 
5 search algorithm. The OSPF topology search algorithm finds 
the OSPF routing backbone 300 and a list of directly and 
indirectly attached areas in the routing domain. The 
algorithm identifies ABRs 304, 306 and 308 used in the 
routing hierarchy. The algorithm can also map a graph of 
10 adjacent routers connected by Designated Routers (DR) and 
Backup Designated Routers (BDR) . If only a single OSPF area 
exists, the backbone area is not necessary. The algorithm 
considers the following factors: 

(1) An area with fewer numbers of internal nodes usually 
15 has fewer ABRs . Hence a large area with a large number of 

ABRs require more concurrent threads to map its topology 
efficiently. 

(2) Avoid reading the link state database of an ABR to 
retrieve topological information. However, it is feasible to 

20 read the same database from an internal node. 

(3) Difference between the OSPF backbone area and a non- 
backbone area. Specifically, a backbone area consists of 
relatively more ABRs than a regular area, and it may not 
have any internal nodes. This calls for different area 

25 mapping strategies for the backbone and non-backbone areas. 

(4) Minimizing SNMP related management traffic. 

In the diagram of Fig. 6, area 0,0.0.2 is a stub area 
which uses default routes to route packets bound for inter- 

30 area and OSPF external destinations. Area 0.0.0.1 is a 

regular area where routers 348, 314 and 316 can accept OSPF 
advertisements regarding both inter-area and external 
destinations. Thus, routers 348, 314 and 316 can optimize 
routes to these destinations. Autonomous System Border 

35 Routers (ASBRs) allow routing information to be exchanged 
between the OSPF domain and other routing domains. Area 
Border Routers (ABRs) 304, 306 and 308 connect non-backbone 
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areas to the OSPF backbone. Routers internal to an area are 
Internal Routers (IRs) such as router Cisco 2500 (Fig. 6) . 
Internal Routers do not keep topology information about 
other areas. OSPF allows virtual links to be set up between 
5 two ABRs . 

The OSPF Domain Manager is responsible for discovering 
OSPF Networks. As shown in Fig. 6B, a single OSPF Domain 
object is created for each OSPF domain that needs to be 
discovered. Each OSPF Domain needs a single OSPF Node to 

10 seed the discovery process. The domain manager has a queue 
that contains OSPF Nodes that need to be processed and has a 
number of threads to perform OSPF Node discovery with (Fig. 
6C) . Each thread will pull one node off the queue and 
discover and process that node. During node processing, 

15 additional nodes will be placed on the queue. Once all the 
nodes have been discovered & processed, the queue will be 
empty. At this point, all the active devices have been 
discovered. The AS borders within this set must now be 
identified. Next the passive network segments need to be 

20 discovered and modeled. This is done by creating an Area 

Manager for each discovered Area, selecting a representative 
node, and reading tables to find network segments, summary 
routes, and external routes that area advertised. 

Fig. 6C shows per thread processing to determine 

25 OSPFNodes (i.e., OSPFNode discovery). In algorithm 405, 

each thread will take a single node off the queue, read the 
neighbor table, process this inf ormation, and add any new 
neighbors to the queue to be discovered. The thread will 
then process other tables explained below. Once the last 

30 node has finished processing, control returns to the main 
algorithm thread. 

Referring to Fig. 6D, in algorithm 410 OSPF Node 
processing is responsible for creating the model of the node 
and populating the node's attributes. During processing, 

35 models for OSPF Areas that the node is a member of will also 
be created. Virtual Links that the node may participate in 
are also discovered if the node is an ABR. Interfaces and 
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adjacencies will be processed and modeled as well. 

Referring to Fig. 6E, in algorithm 420, each interface 
is examined for its area membership. The area ID is used to 
find any existing Area Managers for that area. If none are 
5 found, then a new Area Manager is created. The type of area 
is then determined by checking the flags in the tables. The 
flags indicate whether NSSA is selected, Stub is selected, 
or if both are off that the area is a normal area. All nodes 
that are members of the Area will have identical settings 
10 for the Area, so this is done only when the area is first 
encountered. 

Referring to Fig. 6F, virtual link algorithm 430 
determines, for any ABR, if there are any Virtual Links 
configured. This is done by reading the virtlf Table. For 

15 each virtual neighbor that is encountered, we create an 
object that will represent the connection. Referring to 
Fig. 6G, algorithm 440 initializes OSPF node adjacency list. 
While reading the neighbor table, only those neighbors that 
have progressed to "full" or "two-way" are included in the 

20 adjacency list. 

Referring to Fig. 6H, algorithm 450 initializes 
interfaces and clusters. For each interface in the 
ospf If Table, a model is created. This model is populated 
with information from the table. If the type is either 

25 broadcast or multiple access, then a cluster should be 
created to model the relationship between all the other 
interfaces heard off the network and tag the designated and 
backup routers. 

Referring to Fig. 61, algorithm 460 initializes the 

30 ASBRNodes. For each ASBR, the external networks and routes 
it advertises need to be discovered and modeled. Reading the 
extLsdb table does this. Then the OSPFDomain Manager creates 
objects to represent the networks. 

Algorithm 470, shown in Fig. 6K, finds network segments 

35 within an area. For each network Link State Advertisement in 
the LSDB, create a network model. (Instead of reading the 
LSDB from a representative router from the area, another 
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approach is to read all the interface tables from all 
routers in an area.) After finding network segments in an 
area, referring to Fig. 6K, algorithm 480 finds external 
summary routes. For the representative node, the process 
5 reads the area aggregate table. This table contains the 
external summary routes that are propagated to the network. 
A network model must be created for each one. 

Fig. 7 illustrates diagrammatically a semantic 
Layer-2 topology model. This model is the ^"blueprint" or 
10 ^"template" containing the rules for creating a Layer-2 

topology model. This model is created and maintained by the 
Layer 2 discovery algorithm to be described in the following 
paragraphs . 

Referring again to Fig. 2A, IP subnet 70 was 

15 discovered during a layer 3 analysis. The following Layer-2 
data model description and discovery algorithm determines 
Layer-2 topology information contained within given Subnet 
70. Futhermore, an algorithm described in U.S. Patent 
5,727,157 may be used to discover the connection to Internal 

20 Routers. The Layer-2 discovery algorithm employs object- 
oriented design methodologies to perform the required work. 

The SpanTreeApp model 522 represents the ability 
of device 510 to run and participate in the Spanning 
Tree Protocol . The ability is modeled by creating an 

25 AppRunsOn relationship between the device 510 and the 
SpanTreeApp 522. When the app model is created during 
the ^discover network protocols' 1 process (described 
in Fig. 3A) , the object will read the spanning tree MIB 
tables of the device by sending the 

30 READ_SPANNING_TREE_TABLE message to itself. The 

message is used so it can be overridden by proprietary, 
vendor-specific apps. The; object will then store the 
data. 

The object will then attempt to create a new 
35 SpanTreeRegion model 532 and add the device model 510 
as a member via the RegionHasMember relationship. It 
will do this by first finding all of the Subnets of 
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which the device is a member, then creating a new 
SpanTreeRegion model 532 inside each Subnet if none 
already exists. If one already exists, it will use 
that one. If the device is not yet a member of a 
5 Subnet (because some layer 3 discovery has not yet 
run) , then it will watch the HasMember relation, and 
will continue when it becomes a member of a Subnet. 
When the device has been added as a member of a 
SpanTreeRegion 532, the object will then send the 

10 CORRELATE_REG I ON message to the SpanTreeRegion 532 to 
tell it to re-discover the topology (SpanTreeLinks) in 
that region. 

This object is also registered for the 
DISCOVERJTABLES message. When the message is received, 

15 the object will re-read the STP tables (using the 
READ_SPANNIWG_TREE_TABLE message) and store the new 
data. 

This object also registers for the 
GET_SPANNING_TREE_DATA message. When the message is 
20 received, the object returns the spanning tree MIB data 
stored. If none is stored yet, it will send the 
READ_SPANNING_TREE_TABLE message to get it from the 
device . 

This object registers for the 
25 READ_SPANNING_TREE_TABLE message and reads the dotldStp 
table when triggered. Other proprietary objects 
derived from this object override this message and get 
spanning tree information from another vendor- specif ic 
source . 

30 The SpanTreeRegion model 532 is responsible for 

discovery and modeling of the spanning tree topology in 
a single Subnet. This object listens for the 
CORRELATE_REG I ON message. This object knows that 
during initial creation of models, there are likely to 

35 be many CORRELATE__REGION messages sent to it in a short 
amount of time. To handle this, it does not start 
correlation as soon as it receives the message. It 
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starts a timer (30 seconds) , and only starts 
correlation once the timer expires . Each time a new 
message is sent, it restarts the timer. When the timer 
expires, it performs the correlation by sending the 
5 GET_SPANNING__TREE_DATA message to each region member to 
get spanning tree data, getting all port and MAC 
address info in the region, and then running its 
correlation algorithm. The correlation algorithm is a 
well-known algorithm in the industry. 

10 Modeling the spanning tree topology entails 

determining which ports in this region have a 
forwarding or blocking connection, creating a 
SpanTreeLink model 534, and relating the SpanTreeLink 
to each port 530 via the HasBorder relation. Each link 

15 is then related to the SpanTreeRegion via the 

RegionContainsRegion relation. The root bridge is also 
determined during correlation and stored in an 
attribute of the SpanTreeRegion model 532, 

When re-discovery occurs, the object will simply 

20 find all existing SpanTreeLink models, save them in a 
list, remove all HasBorder associations, and then 
perform the correlation again. It will reuse as many 
of the existing SpanTreeLink models as it can. If it 
runs out, it will create more. If there are some left 

25 over, it will destroy them. 

This object is responsible for detecting changes 
in the spanning tree topology, and triggering re- 
discovery. It does this by registering a timer to 
trigger it periodically. It reads the stpTopChanges 

30 MIB object of the root bridge to detect if a topology 
change has occurred. 

If a topology change has occurred, the object gets 
all of the members of the SpanTreeRegion, and sends 
each of them the DISCOVERJTABLES message to re-read 

35 their spanning tree tables. Then, the object sends the 
CORRELATE_REGION message to itself to rediscover the 
spanning tree topology. 
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The SpanTreeLink model 534 represents a logical, 
spanning tree connection between two ports 530 on two 
different devices which support the Spanning Tree Protocol. 
This object contains an attribute which stores whether the 
5 connection is in the ""forwarding" or ""blocking 11 state. 
The TransparentApp model 512 represents the 
device's ability to run and participate in the 
Transparent Bridging Protocol. The ability is modeled 
in the system by relating the device model 510 to the 

10 TransparentApp model 512 via the AppRunsOn 

relationship. When the app model is created by the 
""network protocol discovery 11 process (described in 
Fig. 3A) , the object will read the transparent bridging, 
forwarding database MIB tables of the device by sending 

15 the READ_TPFDB_TABLE message to itself. The message is 
used so it can be overridden by proprietary, vendor- 
specific apps. The object will then store the data. 

The object will then attempt to create a new 
BroadcastDomain model 504 and add the TransparentApp 

20 model 512 as a member. It will do this by first 

finding all of the Subnets of which the device is a 
member, then creating a new BroadcastDomain model 
inside each Subnet if none already exists. If one 
already exists, it will use that one. If the device is 

25 not yet a member of a Subnet (because some layer 3 
discovery has not yet run) , then it will watch the 
HasMember relation, and will continue when it becomes a 
member of a Subnet. 

When the device has been added as a member of a 

30 BroadcastDomain 504, the object will then send the 

CORREIiATE_REGION message to the BroadcastDomain to tell 
it to re-discover the BridgedLANs in that region. 
This object is also registered for the DISCOVER_TABLES 
message. When the message is received, the object will 

35 re-read the TpFdb tables (using the READ_TPFDB_TABLE 
message) and store the new data. 
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This object also registers for the GETJTPFDBJDATA 
message. When the message is received, the object 
returns the TpFdb MIB data stored. If none is stored 
yet, it will send the READ_TPFDB_TABLE message to get 
5 it from the device. 

This object registers for the READJTPFDB_TABLE 
message and reads the dotldTpFdb table when triggered. 
Other proprietary objects derived from this object 
override this message and get transparent bridging 

10 information from another vendor- specif ic source. 

The BroadcastDomain model 504 is responsible for 
discovery and modeling of the transparent bridging 
(physical) topology in a single Subnet 502. This 
object listens for the CORRELATE_REGION message. This 

15 object knows that during initial creation of models, 
there are likely to be many CORRELATE_REGI ON messages 
sent to it in a short amount of time. To handle this, 
it does not start correlation as soon as it receives 
the message. It starts a timer (30 seconds) , and only 

20 starts correlation once the timer expires. Each time a 
new message is sent, it restarts the timer. When the 
timer expires, it performs the correlation by sending 
the GET_TPFDB_DATA message to each region member to get 
TpFdb data, getting all port and MAC address info in 

25 the region, and then running its correlation algorithm. 
The correlation algorithm is a well-known 
algorithm in the industry and defined in an IETF RFC, 
as well as explained in numerous patents. Modeling the 
transparent bridging topology entails determining which 

30 ports 520 in this region are physically connected, 
creating a BridgedLAN model 506, and relating the 
BridgedLAN to each port via the HasBorder relation. 
Each link is then related to the BroadcastDomain 504 
via the RegionContainsRegion relation. 

35 When re-discovery occurs, the object will simply 

find all existing BridgedLAN models, save them in a 
list, remove all HasBorder associations, and then 
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perform the correlation again. It will reuse as many 
of the existing BridgedLAN models as it can. If it 
runs out, it will create more. If there are some left 
over, it will destroy them. 
5 The BridgedLAN model 506 represents a physical 

connection between two ports 520 on two different devices, 
at least one of which must support the Transparent Bridging 
Protocol. 

Furthermore, the determination of physical topology 
10 in computer networks between repeaters and bridges is 

described in U.S. Patent 5,297,138, Another apparatus and 
method for determining a computer network topology is 
described in U.S. Patent 5,727,157, which is incorporated by 
reference . 

15 Figs. 8 and 8A shows diagrammatically a rich multi- 

layer topological model of a routed network located within 
the communications network of Fig. 1 as stored inside the 
topology mapper 14. This multi-layer topological model can 
be understood also in connection with Fig. 6 (and Figs. 2 

20 and 2A) . The block diagram of Fig. 8 displays an autonomous 
system A/S 2 including an Area 0.0.0,0 (block 600) and 
containing a subnet 2,4/30 (shown in Fig. 6 as 305 and block 
610 in Fig. 8), a subnet 2.8/30 (block 615), a subnet 
2.12/30 (block 620), a subnet 2.16/30 (block 625), and a 

25 subnet 2.20/30 (block 630), all of which are shown in Fig. 6 
in Area 0.0.0.0. In Fig. 6, Area 0.0.0.0 is bounded by 
routers 302, 304, 306, and 308, and this relationship is 
shown in Fig. 8 by block 602, 604, 606, and 608, 
representing the corresponding peer routers. These routers 

30 are found by running the above -described algorithm. 

Furthermore, the block diagrams of Figs. 8 and 8A 
display autonomous system (i.e., Region A/S 2) the 
connection of Area 0.0.0.0 (block 600) to Area 0.0.0.1 
(block 640 in Fig. 8 A) connected by routers 302 and 352 

35 shown in Figs. 6 and 6A, respectively. 

The rich, multi- layer topological model can include 
three modeled topologies: Current, Historical, and Intended 



WO 01/86844 



PCT/US01/14767 



35 

Topology. The system can use different ways to represent 
multiple versions of the examined network topology. The NMS 
can create a whole new model of the topology after detecting 
a change in the topology. This embodiment, while time- 
5 consuming and memory intensive, makes servicing requests of 
historical topologies fast and easy. 

Alternately, the NMS can show relationships between 
network element models, and relationships between element 
models themselves. This is versioned via a start date and 

10 end date indicating the period of time during which the 

relationship existed (or during which an element was in the 
state represented) . The versioning of the port model is 
useful mainly for a critical metric change, such as 
bandwidth for a port. 

15 Preferably, the system performs servicing of the 

historical topology queries. This allows time and memory 
space savings when maintaining the historical model (but a 
difficulty is that the model has to find the relationships 
that were active during the specified time) . 

20 In general, to distinguish between intended topology 

and current topology, the system can construct multiple 
versions, or preferably show network model and relationships 
between network element models and add an ^intended" flag 
to the relationships between topology models. The system 

25 assumes, for the most part, that the modeling process finds 
intended network element, so there is usually no flag on the 
network element models. Fig. 8B shows diagrammatically an 
intended and current relationship. 

Fig. 8B depicts a bridge 660 with ports 0 and 7 (blocks 

30 662 and 664), a bridge 670 with ports 3 and 4 (blocks 672 
and 674) , and a bridge 680 with ports 0 and 7 (blocks 682 
and 684) . Status tables 666, 676, and 686 show the status of 
the respective connections 665, 675, and 685. 

Fig, 8B illustrates the spanning tree relationship 

35 between bridges 660, 670, and 680. For example, the 

connection between port 674 and port 684 was not present, or 
was blocked, when this subnet was originally modeled at 
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around 10:55 (see status tables 666 and 676). The other 
Spanning Tree connections 665 and 675 did exist at that 
time, and the intended flag is set to ""Yes 11 in status 
tables 666 and 676 indicating that this was the desired 
5 state of the network. 

Later, however, the Spanning Tree connection 665 has 
gone down, and a new Spanning Tree connection 685 has come 
up. As shown in status table 686, connection 685 was 
modeled at 11:32 (status table 666), and the operator has 

10 not indicated whether this connection is now intentional or 
not, so the intended flag is set to ""??? 1 1 ( ""unknown 1 1 ) . 
This scheme allows the model to keep a history of changes in 
the network, and to document which changes were intentional. 
This algorithm can be extended to give network elements 

15 an ""intended" status. In some environments, the above- 
described algorithms may discover network elements that are 
not supposed to be on the network. An operator would be 
allowed to validate each network entity model, as well as 
each connection model, thereby specifying which elements are 

20 not part of the intended design. 

Figs. 9, 9A, and 9B are flow diagrams that 
illustrate an algorithm for finding a change in network 
topology, for example, the network topology shown in Figs. 
8, 8A, and 8B. 

25 Fig. 9C is a flow diagram that illustrates an 

algorithm for validating the intended network topology a.fter 
finding changes. 

To be useful, a topological model must respond to 
change and reflect that topological change in the internal 

30 model. There are several aspects of handling topology change 
that impact performance of the network and of the topology 
model itself. Those aspects are monitoring of the network 
and the required update of the data representation. 

The topological model watches a representative element 

35 of the hierarchical network. Generally, a device maintains 
an internal topology of some sort of devices that are 
running the same protocol that it is running. The device 
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keeps an internal topology because it must make data 
forwarding decisions arid knowledge of the topology allows it 
to make an educated forwarding decision. When the device's 
topology agent determines that it must rerun its topology 
5 algorithm the device generally records this as a count or a 
time. The reasons for the algorithm needing to be exercised 
almost always relate to some aspect of topological change or 
some change that may result in forwarding choices being 
altered. A network topology server will monitor one device 

10 of the topology for a record of the topology algorithm being 
executed. As shown in Fig 9, block 902, the network 
topology server will be notified of the topology change by 
either polling the change count, being notified by the 
device, or some other means. This greatly reduces the impact 

15 the network monitoring has upon the production network. 

The second mechanism that the rich topology model 
employs to efficiently handle topology change is 
compartmentalization. The topology model is built upon the 
natural hierarchies of the represented network. When a 

20 topology change does occur, the change, generally, only 
effects entities within the hierarchical domain where the 
changing entity resides. The topology model only needs to 
be updated within this compartment. 

As shown in Fig 9, block 904, the network topology 

25 server will determine the hierarchical domain or region that 
needs to be updated. The region is determined by the 
protocol, which monitor point the change occurred upon, and 
the nature of the change. More than one region may need to 
be updated. For each region, the model is updated 910. For 

30 example, when a topology change is noticed, the change is 
particular to a certain protocol. The topology server may 
notice that a set of bridges "respanned" , that is, the 
Spanning Tree algorithm has run again, by polling the 
dotldStpTopChanges object of a bridge (via SNMP) . Or, we 

35 may get an OSPF specific trap from a router indicating that 
the router has a new peer. The algorithm to update the 
region is specific to the protocol that has caused a 
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change. The Spanning Tree region update is shown in Fig. 9 A 
and 9B. There are similar mechanisms for updating OSPF 
Areas, RIP domains, and other protocols. 

If the notification results in a change to the topology 
5 model (step 936) , then the topology server must determine if 
the change was intentional or not (steps 938, 940) . Step 
920 will be followed by step 922. 

A user of a topological network model may desire to 
reconstruct a historical topology. A topologically centric 

10 fault management system, that handles intermittent faults or 
network trends, could recreate the topology at any given 
time with a topological model that recorded topological 
change. The invention is capable of logging events of 
topology change. 

15 A topdlogy change could consist of a change in the 

status of a node, connection, service or network hierarchy. 
It is natural that the data recorded would be different for 
each type of change. The time of the change would be 
recorded in all instances. For example, in Fig 9A & B blocks 

20 928 & 918 show a timestamp being logged for both a link 
change and a device change. 

A change in a node would require logging the 
information identifying the node, any added or changed 
connections, any added or changed services and any added or 

25 changed hierarchical domains. A change in connection would 
require logging the changing endpoints. For example, in Fig 
9B, block 928 shows that the old model for a spanning tree 
link can be used to store the timestamp to log when the 
change occurred and block 930 shows a new link being created 

30 to capture the new model . A change of service may require 
the logging of possible new or changed connections. A 
hierarchical domain change could require itself and all 
inhabitants requiring recording in addition to connected 
domains . 

35 There are two types of change that occur within a 

network topology, planned and unplanned. Planned changes 
come in the form of adding network capacity with more 
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connections, or nodes to handle and increase in network 
traffic. Planned changes could be maintenance related 
resulting in foreseen network outages. Unplanned outages 
come in the form of accidental loss of some level of 
5 service. Unplanned outages could have originated in 

erroneous configuration, backhoe fiber cuts, CPU failure, 
etc . 

The invention incorporates a concept that hopes to 
capture the network designer/manager's intent in the design 

10 and management of the network. This concept is called, 

"Intent 1 . When the topological model is constructed, either 
through automation or manual process, we are capturing the 
network des igner /manager 1 s intent. When a change occurs, 
the change is marked as being of unknown intention (Fig. 9 

15 and 9A, Block 917, 930). The model is updated to reflect the 
new unintended topology and will service requests for the 
current active topology. After a change occurs and the 
model is updated, the network manager, administrator, or 
operator is queried whether this change should then be 

20 reflected in the official "intent 1 version of the network as 
shown in Fig. 9C. A list of all elements in the model with 
"unknown' intentions is displayed to the network manager, 
administrator, or operator (Block 946) . The user will then 
examine the change reported (Block 948) and will edit the 

25 intended state of the region or element (Block 952) . 

Fig. 10A and 10B are flow diagrams that illustrate 
an algorithm for finding a path to or from an unmodeled 
network entity in a modeled network region. The fault 
diagnosis engine 12 may request a Layer-2 or Layer-3 data 

30 path between two elements, one or both of which is not 
modeled in the topology mapper 14 (step 962) . If the IP 
address was not specified (conditional 964), we start by 
resolving the address via a name service, such as DNS (step 
966) . Once the IP address of the unmodeled entity is known, 

35 we search all the subnet regions (step 968) . Each subnet 
region has an IP address range, specified by subnet address 
and mask. For example, the subnet model 642 (Fig. 8A) 
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contains all IP address between 10.253.8.0 and 10.253.8.15. 
This range is displayed in the diagram in CIDR notation, as 
defined in the IETF RFC2096, 10.253.8.0/28. If we can't 
find a subnet with an IP address range that contains the 
5 unmodeled entity T s IP Address (conditional 970), the 
algorithm fails (step 972) . 

If we can find the subnet (condition 970) , we attempt 
to find the MAC address of the entity (976) . Generally, 
this is done by sending an SNMP request to the entity. 

10 However, there are other methods, such as initiating a proxy 
ARP on a router that borders that subnet. Note the ARP 
itself will not help if the entity is in a different subnet 
than the topology mapper host, because the topology mapper 
host's gateway will responds with the gateway 1 s MAC address, 

15 and not that of the unmodeled entity. If the MAC address 
can be determined (conditional 978) , we search the modeled 
bridges for a port that has this MAC address in its 
forwarding table (also known as Source Address Table or SAT) 
(step 981) . If we find such a bridge port (conditional 

20 982), we know that this unmodeled entity is logically placed 
in the LAW attached to this port, and we can return the path 
to this LAN (step 985) . Otherwise, we just return a path to 
the subnet (step 984) . 

Fig. 11 is a semantic data model used when 

25 constructing the rich multi-layer topological model shown in 
Figs. 8 and 8A. Semantic data model 1000 illustrates 
networking rules for relating network components to each 
other. The components include network elements, regions, 
and ports. The semantic data model also provides logical 

30 information useful for understanding the physical and 
logical connections embodied in the rich mult i- layer 
topology model . 

Semantic data model 1000 includes nodes, connections 
and services . A node is a physical or logical entity that 

35 serves as an aggregation point of a particular function. An 
example of such a physical entity is a chassis that has 
multiple repeated ports. An example of such a logical 
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entity is a VRRP virtual router that has multiple hot- 
standby physical routers. 

Connections 1020 can be either physical or logical 
communication interfaces between entities, wherein a 
5 connection can be one-to-one 1026, one- to-many 1028 and 
many- to-many . For example, a physical connection 1024 is a 
lOBaseT Ethernet cable plugged into two ports of separate 
Ethernet switches. A logical connection 1022 is an HTTP 
session between a web client and a web server. 

10 A service, or capability, is a description of 

functionality provided by an entity. An example is a DNS 
server running on a workstation 1010, or an OSPF routing 
application running on a routing device 1005. 

The modeling of the interdependencies of a network's 

15 collection of nodes, connections and services provide the 
rich, multi- layered topology to a model -based, topology 
centric fault management system (described in the co-pending 
PCT application (Docket ISTo. A3-03WO) entitled: Systems and 
Methods for Diagnosing Faults in Computer Networks, filed on 

20 7 May 2001, which is incorporated by reference) . 

Semantic data model 1000 also supports the notion of 
hierarchical regions, containment with exclusive membership 
and overlay with partial membership shown in Figs. 2, 2A, 6, 
8 and 8A. This notion is embodied by the Region 1032 and 

25 Region Border Element 1030 models. For example, a 

hierarchical region could come from an OSPF network, which 
network includes a region, areas within the region and 
subnets within the areas. Another example of containment 
with exclusive membership is an FDDI LAN, wherein all 

30 members of the FDDI LAN are contained within the next 
hierarchical domain, a subnet or VLAN. For example, an 
overlay with partial membership is an MPLS path over several 
routers running OSPF . These routers can each be a member of 
a different OSPF area. Not all members of each are-a will be 

35 members of the MPLS path. 

Network Elements 1002 such as devices may have certain 
configuration parameters 1040 which need to be modeled and 
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used for fault diagnosis . 

The rich, multi-layer topological model includes all 
such data about the logical and physical entities, 
connections, services and functions, and abstract 
5 containers. A topology centric fault management system can 
use the above-mentioned data to accurately and effectively 
diagnose a root cause fault and show impact of that fault 
and its symptoms. 

The above-described topology mapper 14 may be used 

10 separately or together with other units of network 

management system 10, described in connection with Fig. 1. 
Network management system 10 uses model data stored in 
topology mapper 14 for network management, and fault 
management process. The fault management includes a fault 

15 detection phase, a fault diagnosis phase, an impact analysis 
phase, a prioritization phase, a presentation phase, a 
recourse phase, and a resolution phase. This is a phased, 
component i zed, but interconnected process that can perform 
all aspects of fault management. 

20 A generic fault diagnosis engine, suitable for fault 

diagnosis in any system, is described in the co-pending PCT 
application (Docket No. A3-03WO) entitled: Systems and 
Methods for Managing and Analyzing Faults In Computer 
Networks, filed on 7 May 2001, which is incorporated by 

25 reference. 

To perform a path-dependent fault diagnosis, the 
generic fault diagnosis engine is applied to a path 
dependent algorithm that tests the network using a network 
model provided by topology mapper 14 . This path-dependent 

30 fault diagnosis is described in the co-pending PGT 

application (Docket No. A3-03WO) entitled: Systems and 
Methods for Diagnosing Faults in Computer Networks, filed on 
7 May 2001, which is incorporated by reference. 

Help desk system 18 (Fig. 1) is described in the co- . 

35 pending PCT application (docket no A3-05WO) entitled: Help 
Desk Systems and Methods for use with Communications 
Networks, filed on 7 May 2001, which is incorporated by 
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reference . 

Numerous other embodiments not described in detail 
here can apply the principles described to particular 
applications and are within the scope of the claims. 
5 What is claimed is: 
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CLAIMS 

1. The topology mapper, comprising 

a discovery module constructed and arranged to discover 
network elements in a communications network; 
5 a memory constructed and arranged to store topology 

data received from said discovery module; and 

a presentation module constructed and arranged to 
present data related to topology of said communication 
network . 

10 

2. The method of claim 1, wherein said discovery 
module is further constructed to determine logical 
dependencies between network layers of said communications 
network . 

15 3. The topology mapper of claim 2, wherein said 

discovery module is further constructed and arranged to 
determine dependencies between an application and a system 
in said communication network. 

4. The topology mapper of claim 2, wherein said 
20 discovery module is further constructed to determine 

topological change in said network element. 

5. The topology mapper of claim 1 further including a 
data log constructed and arranged to store information 
related over time to the function of said discovery module. 

25 6. The topology mapper of claim 1, wherein said 

presentation module is constructed and arranged to display 
historical topology data and current topology data. 

7 . The topology mapper of claim 1 , wherein said 
presentation module is constructed and arranged to display 
30 current topology data and intended topology data. 

8 . The topology mapper of claim 1, wherein said 
presentation module is constructed and arranged to display 
said topology data in a multi-layer format. 

9. The topology mapper of claim 8 constructed and 
35 arranged to display interdependencies among elements of two 
of said network layers. 
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10. The topology mapper of claim 1 constructed and 
arranged to display topological change for the purposes of 
historical reconstruction. 



5 11. A method for creating a multi-layer topological 

model, comprising the acts of: 

discovering network elements in a communications 
network; 

storing topology data received from said discovery 
10 module; and 

presenting data related to topology of said 
communication network. 

12. A method of providing topology data to a network 
15 management system, comprising the acts of: 

discovering network elements in a communications 
network; 

storing topology data received from said discovery 
module ; and 

20 presenting data related to topology of said 

communication network. 

13. A network management system including a topology 
mapper comprising: 

25 a discovery module constructed and arranged to discover 

network elements in a communications network; 

a memory constructed and arranged to store topology 
data received from said discovery module; and 

a presentation module constructed and arranged to present 
30 data related to topology of said communication network. 
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